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Serial Namberi Qy/ZSjOVfi ff?" g A| T P D isSP^S 
Q Changed a file from non-ASCII to ASCII ' ,M * C H gfjf.^* -GOB* 

Changed4he margins in cases wheretne sequence text was "wrapped.' downd^l^ejct line. ^ 
Edited a format error in the Current Application Data section, specifically: 



V 

Staff 




Edited the Current Application Data section with the aduaJ current number. The number inputted by the 
applicant was □ the prior application data; or □ other _ , • 

| — | Added the mandatory heading and subheadings for 'Current Application Data*. 

(—) Edited the 'Number of Sequences" field. The applicant spelled out a number instead of using an integer, 
r— | Changed the spelling ol a mandatory field (the headings ofsubheadings), specific**/:. 



| — | Corrected the SEQ ID NO when obviously incorrect. The sequence gumbers lhat were edited 
r— ] Inserted or corrected a nucleic number at the end of a nucleic line. SEQ ID NO'S edited: 



were: 



Corrected subheading placement. All responses must be on the same line as each subheading. If the 
applicant placed a response below the subheading, this was moved to Us appropnate place. 




Inserted colons after headings/subheadings. Headings edited included: 
Deleted extra, invalid, headings used by an applicant, specifically: 

„ ■ * . ^| 

Deleted: □ non-ASCII "garbage" at the beginning/end ol files: □ secretary inilials/filenarn^^^nd of file 
□ page numbers throughout text; □ other invalid text, such as _ • 

Inserted mandatory headings, specifically: _ £2-2x>~7 2* f ^ = 

Corrected an obvious error in the response, specifically: 



Edited .identifiers where upper case is used but lower case is required, or vice versa. 
Con-ecTod an error in the Number ol Sequences field, specifically: 

A "Hard Page Break" code was inserted by the applicant. All occurrences had to be deleted. 

Deleted ending* stop codon in amino acid sequences : andadjusted the -(AJLength:" field accordingly (error 
due to a Palentln bug). Sequences corrected: ■; — : — . — _ - r— — 

Other: - — . 



m&mm er: The above corrections must be^m ariic ated to the applicant in the first Ottlce, 



Action. DO NOT send a copy orthis-facm, 
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RAW SEQUENCE LISTING DATE: 01/12/2001 

PATENT APPLICATION: US/09/12 5 , 0 3 IB ' TIME: 09:23:41 

Input Set : A:\Pto.amc 

Output Set: N:\CEF3\01112001\I125031B.raw 

3 <1.U)> APPLICANT: LONG ACRE - ANDRE, SHIRLEY 
-1 ROTH ,• CHARLES 

5 BARNWELL , JOHN 

6 MENDIS, KAMTNT 

7 NATO , FA. Rl DAB A HO 

9 <l-20> TITLE OF INVENTION: RECOMBINANT PROTEIN CONTAINING A C - TERMINAL FRAGMENT OF 
10 PLASMODIUM MSP-1 

12 <130> FILE REFERENCE: 0 660 - 013 9- OXPCT 

.14 <140> CURRENT APPLICATION NUMBER: 09/12 5, 031B 

15 <141> CURRENT FILING DATE: 199 9-03-10 

17 <150> PRIOR APPLICATION NUMBER: PCT/FR9 7/00290 

18 <151> PRIOR FILING DATE: 1997-02-14 

20 <150> PRTOR APPLICATION NUMBER: FR96/01822 
2.1 <151> PRIOR FILING DATE: 1996-02-14 
2 3 <16 0> NUMBER OF SEQ ID NOS : 15 

2 5 <17 0> SOFTWARE: Pa tent In Ver. 2.1 

27 <210> SEQ ID NO: 1 

28 <211> LENGTH: 291 

29 <212> TYPE: ON A 

3 0 <213> ORGANISM: Artificial Sequence 
3 2 <2 20> FEATURE: 

33 <223> OTHER INFORMATION: Description of Artificial Sequence: SYNTHETIC 

3 5 <220> FEATURE: 

3 6 <221> NAME/KEY: CDS 

37 <222> LOCATION: (1)..(291) 

3 9 <4 00> SEQUENCE: 1 

4 0 gaa ttc aac a Lc teg cag cac can tgc gtg aaa aaa caa tgt ccc yag 48 
4.1 G.lu Phe Asn lie Se.r Gl.n His Gin Cys Val Lys Lys Gin Cys Pro Glu 

4 2 1 5 10 15 

44 aac tct ggc tgt ttc ay a cac ttg gac gag aga gay yag tgt aaa tgt 96. 

4 5 Asn Ser Gly Cys Phe Arg His Leu Asp Glu Arg Glu G.lu Cys Lys Cys 

4 6 20 25 30 

4 8 ctg ctg aac tac aaa cag gag ggc gac aag tyc gtg gag aac ccc aac 14 4 

4 9 Leu Leu Asn Tyr Lys Gin Glu Gly Asp Lys Cys Val Glu Asn Pro Asn 
50 3 5 4 0 4 5 

52 coy acc tgt aac gay aac aac ggc ggc tgt gac yea gac gee aaa tgc 192 

5 3 Pro Thr Cys Asn Glu Asn Asn Gly Gly Cys Asp Ala Asp Ala Lys Cys 
54 5 0 55 60 

56 acc gag gay gac tcy ggc age aac ggc aag aaa ate acg tgt gag tgt 240 
5 7 Thr Glu Glu Asp Ser Gly Ser Asn Gly Lys Lys lie Thr Cys Gl.u Cys 
5B 65 70 75 80 

60 acc aaa ccc gac teg tac ccg ctg ttc gac ggc ate ttc tgc age taa 288 

61 Thr Lys Pro Asp Ser Tyr Pro Leu Phe Asp Gly He Phe Cys Ser 

62 ^ 85 90 95 

64 taa * 291 

68 <210> SEQ ID NO: 2 

69 <211> LENGTH: 95 



f ile://C:\Crf3\Outhold\VsrI 1 2503 1 B.htm 
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RAW SEQUENCE LISTING DATE: 01/12/2001 

PATENT APPLICATION: US/09/12 5, 03 IB TIME: 09:23:41 



Input: Set : A:\Pto.amc 

Output Set: N:\CRF3\01112001\I125031B.raw 

70 <212> TYPE: PKT 

7.1 <213> ORGANISM: Artificial Sequence 
7 3 <220> FEATURE: 

74 <22 3> OTHER INFORMATION: Description of Artificial Sequence: SYNTHETIC 
76 <400> SEQUENCE: 2 



77 


Glu 


Phe 


Asn 


I le 


Ser 


G 1 n 


His 


Gin 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 


78 


1 








5 










10 










15 




79 


Asn 


Ser 


Gly 


Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 


80 








20 










25 










30 






81 


Leu 


Leu 


Asn' 


Tyr 


Lys 


Gin 


Glu 


Gly 


Asp. 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 


82 






35 










40 










45 








83 


Pro 


Thr 


Cys 


Asn 


Glu 


ASM 


Asn 


Gly 


Gly 


Cys 


Asp 


Ala 


Asp 


Ala 


Lys 


Cys 


8-1 




50 










55 










60 










8 5 


Thr 


Glu 


Glu 


Asp 


Ser 


Gly 


Ser 


Asn 


Gly 


Lys 


Lys 


He 


Thr 


Cys 


Glu 


Cys 


86 


65 










70 










75 










80 


87 

88 


Thr 


Lys 


Pro 


Asp 


Ser 
85 


Tyr 


Pro 


Leu 


Phe 


Asp 
90 


Gly 


He 


Phe 


Cys 


Ser 
95 





92 <210> SEQ ID NO: 3 
9 3 <211> LENGTH : 27 9 

94 <212> TYPE: DNA 

95 <213> ORGANISM: Plasmodium falciparum 

9 7 <4 00> SEQUENCE: 3 

98 aacatttcac aacaccaatg cgtaaaaaaa caatgtccag aaaattctgg atgtttcaga 60 

99 caLttagatg aaagagaaga atgtaaatgt ttattaaatt acaaacaaga agglgataaa 120 

100 tgtgttgaaa atccaaatcc tacttgtaac gaaaataatg gtggatgtga tgeaqatgee 180 
.1.01. aaatgLaccg aagaagattc aggtagcaac ggaaagaaaa tcacatgtga atgtactaaa 24 0 
102 cctgattctt atccactttt cgatggtatt ttctgcagt '279 

10 5 <210> SEQ ID NO: 4 

106 <21I> LENGTH: 3 54 

107 <212> TYPE: DNA 

.108 <213> ORGANISM: Artificial Sequence 
HO <220> FEATURE: 

111 <2 23> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 

113 <2 2 0> FEATURE: 

114 <221> NAME/KEY: CDS 

115 <222> LOCATION : (!)..( 354) 

117 <400> SEQUENCE: 4 

118 gaa the aac ate Leg cag cac 

119 Glu Phe Asn He Ser Gin His 

120 1 5 

122 aac tct ggc tgt ttc aga cac 

123 Asn Ser Gly Cys Phe Arg His 

124 20 

126 ctg cLg aac tac aaa cag gag 

127 Leu Leu Asn Tyr Lys Gin Glu 

128 3 5 

130 ccg acc tgt aac gag aac aac 

131 Pro Thr Cys Asn Glu Asn Asn 

132 50 " 55 



caa tgc gtg aaa aaa caa tgt ccc gag 4 8 
Gin Cys Val Lys Lys Gin Cys Pro Glu 

10 :i 5 

ttg gac gag aga gag gag tgt aaa tgt 96 
Leu Asp Glu Arg Glu Glu Cys Lys Cys 

25 30 
ggc gac aag tgc gtg gag aac ccc aac 14 4 
Gly Asp Lys Cys Val Glu Asn Pro Asn 

40 45 
ggc ggc tgt gac gca gac gec aaa tgc 192 
Gly Gly Cys Asp Ala Asp Ala Lys Cys 
60 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/12 5 , 03 IB 



DATE: 01/12/2001 
TIME: 09:2.1:4:1 



Input Set : A:\Pto.amc 

Output Set: N:\CRF3\01112001\I125031B.raw 



134 
135 

13 6 
138 
139 
1-10 
.142 

14 3 
14 4 
146 

14 7 
148 
151 
152 
153 
154 
156 
157 

15 9 
160 
161 

16 2 
163 
16 4 
165 
166 
16 7 
16 8 

16 9 
170 
171 
172 

17 3 
174 
175 

17 9 
180 
181 
.182 
184 
1-8 5 
186 
187 

18 8 
189 
190 
193 
194 
195 



240 



288 



336 



354 



acc gag gag qac teg ggc age aac ggc aag aaa ate acg tgt gag tgt 
Thr Clu Glu Asp Ser Gly Ser Asn Gly Lys Lys lie Thr Cys Glu Cys 
65 70 75 ■ 80 

acc aaa ccc gac teg toe ceg ctcj ttc gac ggc ate ttc tgc age tec 
Thr Lys Pro Asp Ser Tyr Pro Leu Phe Asp Gly lie Phe Cys Ser Ser 

85 90 95 

tct aac ttc ttg ggc ate teg ttc ttg ttg ate etc' atg ttg ate ttg 
Ser Asa Phe Leu Gly lie Ser Phe Leu Leu J.le Leu Met Leu lie Leu 

100 105 .110 

tdc age ttc att taa taa 
Tyr Ser Phe He 
I if. 

<210> SEQ ID NO: 5 
<211> LENGTH: 116 
<212> TYPE: PKT 

<213> ORGAN TSM : Artificial Sequence 
<220> FEATURE: ' 

<223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
<4 00> SEQUENCE: 5 

Glu Phe Asn He Ser Gin His Gin Cys Val Lys Lys Gin Cys Pro Glu 

1 5 10 15 

Asn Ser Gly Cys Phe Arg His Leu Asp Glu Arg Glu Glu Cys Lys Cys 



M IT 1001 



20 



25 



30 



Leu Leu Asn Tyr Lys Gin Glu Gly Asp Lys Cys Val Glu Asn Pro Asn 



4 0 



45 



Pro Thr Cys Asn Glu Asn Asn Gly Gly Cys Asp Ala Asp Ala Lys C; : 



50 



55 



60 



Thr Glu Glu Asp Ser Gly Ser Asn Gly Lys Lys He Thr Cys Glu Cys 



65 



70 



75 • 



80 



Thr Lys Pro Asp Ser Tyr Pro Leu Phe Asp Gly He Phe Cys Ser Ser 



85 



90 



95 



Ser Asn Phe Leu Gly He Ser Phe Leu Leu Tie Leu Met Leu He Leu 



105 



110 



100 

Tyr Ser Phe He 
115 

<210> SEQ TD NO: 6 
<211> LENGTH; 342 
<212> TYPE: DMA 

<213> ORGANISM: Plasmodium falciparum 
<4 00> SEQUENCE: 6 

aacatttcac aaeaccaatg eg ta aaa aaa eaatgtccag aaaattctgg atgtttcaga 6 0 
catttagatg aaagagaaga atgtaaatgt ttattaaatt acaaacaaga aggtgataaa 120 
tgtgttgaaa atccaaatcc tacttgtaac gaaaataatg gtggatgtga tgcagatgee 180 
aaatgtaccg aagaagattc aggtagcaae ggaaagaaaa tcacatgtga atgtaetaaa 24 0 
cctgattctt atccactttt cgatggtatt ttctgcagtt cctctaactt cttaggaata 300 
tcattcttat taa tac teat gttaatatta tacagtttca tt 34 2 

<210> SEQ ID NO: 7 
<211> LEMGTH: 38 7 
<2I2> TYPE: DNA 
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RAW SEQUENCE LISTING DATE: 01/12/2001 

PATENT APPLICATION : US/09/1 2 5 , 03 IB TIME : 09:23:41 



Input Set : A:\Pto.amc 

Output Set: N:\CRF3\01112001\I12503lB.raw 



1 9 6 


<213> ORGANISM: 


Plasmodium 


falciparum 
















TOO 


<220> FEATURE: 




























199 


<221> NAME/KEY: 


CUS 


























200 


<222> LOCATION : 


( 1 ) 


. . (387) 






















'> n *"> 


<4 00> SEQUENCE: 


7 


























2 0 3 


a tg 


a a y 


gey 


eta 


etc 


It t 


tty 


ttc 


to t 


ttc 


at L 


t tt 


ttc 


yt t 


ace 


a<-ja 


4 8 


2 04 


Met 


Lys 


Ala 


Leu 


Leu 


Ph e 


Leu 


Pits 


Ser 


Phe 


1 .1 e 


Phe 


Phe 


val 


Th r 


Lys 




20 5 


l 








5 










10 










1 5 






207 


tgt 


ca a 


tgt 


gaa 


a ca 


gaa 


agt 


tat 


aag 


cag 


ctt 


gta 


gee 


aac 


gtg 


gac 


9 6 


208 


Cys 


Gin 


Cys 


Glu 


Thr 


Glu 


Ser 


'J'y r 


Lys 


Gin 


Leu 


Val 


Ala 


Asn 


va i 


Asp 




209 








2 0 










25 










30 








2 1 1 


yaa 


ttc 


aac 


ate 


teg 


cay 


cac 


caa 


tyc 


gty 


aaa 


aaa 


caa 


tgt 


eee 


gag 


1 4 4 


2 "I 2 


Glu 


Phe 


Asn 


lie 


Ser 


Gin 


His 


Gin 


Cys 


Val 


.Lys 


Lys 


Gin 


Cys 


Pro 


G.l u 




213 






. 35 










40 










4 5 










215 


aac 


tct 


yqc 


tgt 


ttc 


aga 


cac 


ttg 


yac 


yay 


ay a 


gag 


gag 


tgt 


aaa 


tgt 


19 2 


216 


Asn 


Ser 


Gly 


Cys 


Phe 


Ary 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 




217 




50 










55 










60 












2 19 


ctg 


cty 


aac 


tac 


aaa 


eag 


gag 


ggc 


gac 


aag 


tgc 


gtg 


gag 


aac 


eee 


aac 


24 0 


220 


Leu 


Leu 


Asn 


Tyr 


Lys 


Gin 


Glu 


ciy 


Asp 


Lys 


Cys 


V a 1 


Glu 


Asn 


Pro 


Asn 




221 


65 










70 










7 5 










8 0 




223 


ccy 


acc 


tgt 


aac 


yag 


aac 


aac 




ggc 


tgt 


gac 


yea 


gac 


g e c 


aaa 


tgc 


2 88 


224 


Pro 


Thr 


Cys 


Asn 


Glu 


Asn 


Asn 


G.ly 


Gly 


cys 


Asp 


Ala 


Asp 


Ala 


Ly s 


Cys 




225 










85 










90 










95 






2 2 7 


HOC 


gay 


gag 


gac 


teg 


ggc 


aye 


aac 


ggg 


aag 


aaa 


a tc 


a eg 


tgt 


gag 


ty L 


3 3 6 


228 


Thr 


Glu 


Glu 


Asp 


Ser 


Gly 


Ser 


Asn 


G.ly 


Lys 


Ly s 


lie 


Thr 


Cvs 


Glu 


Cys 




229 








100 










10 5 










110 








231 


acc 


aaa 


eee 


g a c 


teg 


tac 


ccg 


ctg 


ttc 


gac 


ggc 


ate 


ttc 


tgc 


aye 


La a 


3 8 4 


232 


Thr 


Lys 


Pro 


Asp 


Ser 


Tyr 


Pro 


Leu 


Phe 


Asp 


Gly 


lie 


Phe 


Cys 


Ser 






23 3 






115 










120 










125 










235 


tati 
































38 7 


2 3 9 


<210> SEQ ID NO 


8 


























240 


<211> LENGTH: 11 


17 


























241 


<212> TYPE: 


PRT 




























242 


<213> ORGANISM: 


Plasmodium falciparum 
















24 4 


<4 0 0> SEQUENCE: 


8 


























245 


wet 


Lys 


Ala 


Leu 


Leu 


Phe 


Leu 


Phe 


Ser 


Phe 


lie 


Phe 


Phe 


Va 1 


Thr 


Lys 




24 6 


1 








5 










10 










15 






247 


Cys 


Gin 


Cys 


Glu 


Thr 


Glu 


Ser 


Tyr 


Lys 


Gin 


Leu 


val 


Ala 


Asn 


Val 


Asp 




248 








2 0 










2 5 










30 








24 9 


Glu 


Phe 


Asa 


lie 


Ser 


Gin 


His 


Gin 


Cys 


Va I 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 




250 






35 










4 0 










4 5 










251 


Asn 


Ser 


Gly 


Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 




252 




50 










55 










60 












253 


Leu 


Leu 


Asn 


Tyr 


Lys 


Gin 


Glu 


Gly 


Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 




2 54 


65 










70 










75 










80 




255 


Pro 


Thr 


Cys 


asn 


Glu 


Asn 


Asn 


Gly 


Gly 


Cys 


Asp 


Ala 


Asp 


Ala 


Lys 


Cys 




25G 










G5 










90 










y 5 






2 7 


Thr 


Glu 


Glu 


Asp 


Ser 


Gly 


Ser 


Asn 


Gly 


Lys 


Lys 


lie 


Thr 


Cys 


Glu 


Cys 





2'. 3 • 100 105 1.10 
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RAW SEQUENCE LISTING DATL : 01/12/2001 

PAT KMT APPLICATION : US/09/12 5 , 03 IB TIME : 09:23:41 

Input Set : A:\Pto.amc 

Output Set: N:\CRF3\01112001\I125031B.raw 

2 59 Thr Lys Pro Asp Ser Tyr Pro Leu Phe Asp Gly lie Phc Cys Ser 
26 0 115 120 125 

264 <2.'i0> SEQ ID NO: 9 

26 5 <211> Lh'NGTH: 3 30 

266 <212> TYPE!: DNA 

267 <2.13> ORGANISM: Plasmodium falciparum 

269 <220> FEATURE: 

270 <221> NAME/KEY: CDS 

271 <222> LOCATION: (1)..(3 

27 3 <4 00> SEQUENCE: 9 

274 gaa aca gaa agt tat aag 

275 Glu Thr Glu Ser Tyr Lys 

276 .1 5 

278 ate teg cag cac oaa tgc 

279 lie Ser Gin His Gin Cys 
200 20 

282 tgt ttc aga cac ttg gac 

283 Cys Phe Arg His Leu Asp 

284 3 5 

286 lac aaa cag gag ggc gac 

287 Tyr Lys Gin Glu Gly Asp 
2 88 5 0 

290 aac gag aac aac ggc ggc 

291 Asn Glu Asn Asn Gly Gly 

292 65 70 

294 gac teg ggc age aac ggc 

295 Asp Ser Gly Ser Asn Gly 

29 6 8 5 

298 gac teg tac ccg ctg ttc 

299 Asp Ser Tyr Pro Leu Phe 
W--> 300 100 

303 <210> SEQ TD NO: 10 

304 <211> LENGTH : 108 

305 <212> TYPE : PRT 

30 6 <213> ORGANISM: Plasmorii urn falciparum 



308 


<4 00> SEQUENCE: 


10 
























309 


Glu 


Thr 


Glu 


Ser 


Tyr 


Lys 


GIjj 


Len 


Val 


Ala 


Asn 


Vol 


Asp 


Glu 


Phe 


Asn 


310 


1 








5 










10 










15 




3i:i 


lie 


Ser 


Gin 


lij.S 


Gin 


Cys 


Vai 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 


Asn 


Set- 


Gly 


312 








20 










25 










30 






313 


Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 


Leu 


Leu 


Asn 


314 






35 










40 










4 5 








315 


Tyr 


Lys 


Gin 


Glu 


Gly 


Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 


Pro 


Thr 


Cys 


316 




50 










55 










. 60 










317 


Asn 


Glu 


Asn 


Asn 


Gly 


Gly 


Cys 


Asp 


Ala 


Asp 


Ala 


Lys 


Cys 


Thr 


Glu 


Glu 


318 


65 










70 










75 










80 


319 


Asp 


Ser 


Gly 


Ser 


Asn 


Gly 


Lys 


Lys 


lie 


Thr 


Cys 


Glu 


Cys 


Thr 


Lys 


Pro 


320 










85 










90 










95 




321 


ASp 


Se r 


Tyr 


Pro 


Leu 


Phe 


ASp 


Gly 


lie 


Phe 


Cys 


Ser 











30) 



cag ctt ita gec aac gtg gac gaa ttc aac 48 

Glh Leu val Ala Asn Val Asp Glu Phe Asn 

10 15 

gtg aaa aaa caa tgt ccc gag aac tct ggc 96 

Val Lys Lys Gin Cys Pro Glu Asn Ser Gly 

25 30 

gag aga gag g-ag tgt aaa tgt ctg ctg aac 144 

Glu Arg Glu Glu Cys Lys Cys Leu Leu Asn 

4 0 4 5 

aag tgc gtg gag aac ccc aac ccg acc tgt 19 2 

Lys Cys Val Glu Asn Pro Asn Pro Thr Cys 

55 60 

tgt gac gca gac gec aaa tgc acc gag gag 24 0 

Cys Asp Ala Asp Ala Lys Cys Thr Glu Glu 
75 80 

aag aaa ate acg tgt gag tgt ace aaa ccc 288 

Lys Lys Tie Thr Cys Glu Cys Thr Lys Pro 

90 95 

gac ggc ate ttc tgc age taa taa 3 30 

Asp Gly He Phe Cys Ser 

105 110 
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VERIFICATION SUMMARY DATE: 01/12/2001 

PATENT APPLICATION: US/09/12 5 , 03 IB TIME : 09:23:42 

Input: Set : A:\Pto.amc 

Output Set: N:\CRF3\01112001\I125031B.raw 
L:300 M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 9 
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1641 



RAW SEQUENCE LISTING DATE: 01/08/2001 

PATFJNTAPPlJ.CAT.tON: US/09/12 5 , 0 3 IB TIME: 15:01:47 

Input Set : A:\660139.app 

Output Set: N:\CRF3\01082001\I125031B.raw 

3 <110> APPLICANT: LONGACRE- ANDRE, SHI RLE V fcl . t 

4 roth, charlhs Does Not Comply 

I MrZf L am?^t Corrected Diskette Needed 

6 MEHDIS, KAMI N I 

7 NATO , FARTDABANO 

9 <120> TITLE OF INVENTION : RECOMBINANT PROTEIN CONTAINING A C - TERMINAL FRAGMENT OF 
10 PLASMODIUM MSP-1 

12 <130> FILE REFERENCE: 0660 -0139 -OXPCT- 

14 <1 A 0 > CUR KENT A P P L I CAT ION N UMB BR : 09/125 , 031B 

15 <141> CURRENT FILING DATE: 1999 -03 -.10 

17 <150> PRIOR APPLICATION NUMBER: PCT/FR9 7/00290 

18 <151> PRIOR FILING DATE: 19 97-02-14 

20 <150> PRIOR APPLICATION NUMBER: FR96/01822 

21 <151> PRIOR FILING DATE: 1996-02-14 
2 3 <160> NUMBER OF SEQ ID NOS : 15 

25 <170> SOFTWARE: PatentTn Ver. 2.1 

27 <210> SEQ ID NO: 1 

2B <211> LENGTH: 291 

2 9 <2.12> TYPE: DNA 

30 <213> ORGANISM: Artificial Sequence 

32 <220> FEATURE: 

33 <223> OTHER INFORMATION: Description of Artificial Sequence: SYNTHETIC 

3 5 <220> FEATURE: 

36 <221> NAME/KEY: CDS 

37 <222> LOCATION: (!)..< 291) 
39 <4 00> SEQUENCE: 1 



4 0 


gaa 


ttc 


aac 


ate 


teg 


cag 


cac 


caa 


tgc 


gtg 


aaa 


aaa 


caa 


tgt 


ccc 


gag 


4 8 


41 


Glu 


Phe 


Asn 


lie 


ser 


Gin 


His 


Gin 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 




4 2 


1 








5 










10 










15 






44 


aac 


ter- 


99C 


tgt 


ttc 


aga 


cac 


ttg 


gac 


gag 


aga 


gag 


gag 


tgt 


aaa 


tgt 


96 


45 


Asn 


ser. 


Gly 


Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


cys 


Lys 


Cys 




46 








2 0 










25 










30 








48 


ctg 


ctg 


aac 


tac 


aaa 


cag 


gag 


ggc 


gac 


aag 


tgc 


gtg 


gag 


aac 


ccc 


aac 


144 


4 9 


Leu 


Leu 


Asn 


Tyr 


Lys 


Gin 


Glu 


Gly 


Asp 


Lys 


Cys 


Va 1 


Glu 


Asn 


Pro 


Asn 




50 






35 










40 










45 










52 


ccg 


acc 


tgt 


aac 


gag 


aac 


aac 


ggc 


ggc 


tgt 


gac 


gca 


gac 


gec 


aaa 


tgc 


192 


5 3 


Pro 


Thr 


cys 


Asn 


Glu 


Asn 


Asn 


Gly 


Gly 


Cys 


Asp 


Ala 


Asp 


Ala 


Lys 


Cys 




54 




50 










5 5 










60 












56 


acc 


gag 


gag 


gac 


teg 


ggc 


age 


aac 


ggc 


aag 


aaa 


ate 


a eg 


tgt 


gag 


tgt 


24 0 


57 


Thr 


Glu 


Glu 


Asp 


Ser 


Gly 


Ser 


Asn 


Gly 


Lys 


Lys 


Tie 


Thr 


Cys 


Glu 


Cys 




58 


65 










70 










75 










80 




60 


acc 


aaa 


ccc 


gac 


teg 


tac 


ccg 


ctg 


LLC 


gac 


ggc 


ate 


ttc 


tgc 


age 


taa 


288 


61 


Thr 


Lys 


Pro 


Asp 


Ser 


Tyr 


Pro 


Leu 


Phe 


Asp 


Gly 


lie 


Phe 


Cys 


Ser 






62 










85 










90 










95 






64 


Uaa 
































291 



68 <2.10> SEQ TD NO: 

69 <2U> LENGTH: 95 
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70 <212> TYPE : PRT * 
71. <2TTft\ ORGANISM: Artificial Sequence 



sequence . 



W--> 7 2(^ <220 y J! JLAIUKJL! 

12^^S> OTHER INFORMATION: Description 'of Artificial Sequence: SYNTHETIC 
74 <400> SEQUENCE: 2 



75 


Glu 


Phe 


Asa 


lie 


Ser 


Gin 


His 


Gin 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Gin 


76 


1 








5 










10 










15 




77 


As n 


ser 


G.ly 


Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arq 


Glu 


Glu 


Cys 


Lys 


Cys 


78 








20 










2 5 










30 






79 


Leu 


Leu 


Asn 


Tyr 


Lys 


Gin 


Glu 


Gly 


Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 


80 






35 










40- 










45 








81 


Pro 


Thr 


Cys 


Asn 


Glu 


Asn 


Asn 


Gly 


Gly 


Cys 


Asp 


Ala 


Asp 


Ala 


L YS 


Cys 


82 




50 










55 










60 










83 


Thr 


G.lu 


Glu 


Asp 


Ser 


Gly 


Ser 


As n 


Gly 


Lys 


Lys 


lie 


Thr 


Cys 


Glu 


Cys 


84 


65 










70 










75 










80 


85 


Thr 


Lys 


Pro 


Asp 


Ser 


Tyr 


Pro 


Leu 


Phe 


Asp 


Gly 


lie 


Phe 


Cys 


Ser 




86 










85 










90 










95 




90 


<210> SEQ ID NO 


3 

























91 <2I1> LENGTH: 2 79 

92 <2.12> TYPE: DNA 

93 <213> ORGANISM: Plasmodium falciparum 

95 <40()> SEQUENCE: 3 

96 aacatttcac aacaccaatg cgtaaaaaaa caatgtccag aaaattctgg atgtttcaga 60 

97 catttagatg aaagagaaga atgtaaatgt ttattaaatt acaaacaaga aggtgataaa 120 

98 tgtgttgaaa atccaaatcc tacttgtaac gaaaataatg gtggatgtga tgeagatgee 180 

99 aaa tgtaceg aagaagattc aggtagcaac ggaaagaaaa tcacatgtga atgtactaaa 24 0 

100 cctgattctt atccactttt cgatggtatt ttctgcagt 279 

103 <210> SEQ ID NO: 4 

104 <211> LENGTH: 354 

105 <212> TYPE: DNA 

106 <213> ORGANISM : Artificial Sequence 

108 <220>- FEATURE: 

109 <223> OTHER INFORMATION: Description of Artificial Sequence : SYNTHETIC 
111 <220> FEATURE: 



112 <221> NAME/KEY 

113 <22 2> LOCATION 
115 <400> SEQUENCE 



CDS 

(1) . . (354 ) 
4 

116 gaa ttc aac ate teg cag cac caa tgc gtg aaa aaa caa tgt ccc gag 4 8 
.117 Glu Phe Asn lie Ser Gin His Gin Cys Val Lys Lys Gin Cys Pro Glu 
118 1 5 10 15 

120 aac tct ggc tgt ttc aga cac ttg gac gag aga gag gag tgt aaa tgt 96 

121 Asn Ser Gly Cys Phe Arg His Leu Asp Glu Arg Glu Glu Cys Lys Cys 

122 20 25 30 

124 ctg ctg aac tac aaa cag gag ggc gac aag tgc gtg gag aac ccc aac 144 

125 Leu Leu Asn Tyr Lys Gin G.lu Gly Asp Lys Cys Val Glu Asn Pro Asn 

126 3 5 4 0 4 5 

128 ccg acc tgt aac gag aac aac ggc ggc tgt gac gca gac gec aaa tgc 192 

129 Pro Thr Cys Asn G.lu Asn Asn Gl.y Gly Cys Asp Ala Asp Ala Lys Cys 

130 50 ~ 55 60 
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.1 32 


acc 


9^9 


gag 


gac 


teg 


ggc 


age aac ggc 


aag 


aaa 


ate acg tgt gag 


tg t 


24 0 


133 


Thr 


Glu 


Glu 


Asp 


Ser 


Gly 


Ser Asn Gly 


Lys 


Lys 


He Thr Cys Glu 


Cys 




134 


65 










70 






75 




80 




136 


acc 


aaa 


ccc 


gac 


teg 


tac 


ccg etg ttc 


gac 


ggc 


ate ttc tgc age 


tec 


288 


137 


Thr 


Lys 


Pro 


Asp 


Ser 


Tyr 


Pro Leu Phe 


Asp 


Gly 


lie Phe Cys Ser 


Ser 




138 










85 






90 




95 






140 


tct 


aac 


ttc 


ttg 


ggc 


ate 


teg ttc ttg 


ttg 


ate 


etc atg ttg ate 


ttg 


336 


141 


Ser 


Asa 


Phe 


Leu 


Gly 


lie 


Ser Phe Leu 


Leu 


He 


Leu Met Leu He 


Leu 




14 2 








100 






105 






110 






14 4 


tac 


age 


ttc 


att 


taa 


taa 












354 


14 5 


Tyr 


Ser 


Phe 


lie 


















14 6 






:i 15 




















149 


<210> SEQ ID NO: 


: 5 
















150 


<211> LENGTH: 116 
















151 


<212> TYPE: 


PRT 


















15 2 




!N ORGANISM : 


Artificial Sequence 












W--> 153 


K2 2 0>J FEATURE: 


















153 


\i2J 


j/ OTHER 


INFORMATION : 


: Description of 


Artificial Sequence: 


: SYNTHETIC 


1.5 5 


<400> SEQUENCE: 


5 
















156 


Glu 


Phe 


Asa 


He 


Ser 


Gin 


His Gin Cys 


Val 


Lys 


Lys Gin Cys Pro 


Glu 




157 


1 








5 






10 




15 






158 


Asa 


Ser 


Gly 


Cys 


Phe 


Arg 


His Leu Asp 


Glu 


Arg 


Glu Glu Cys Lys 


Cys 




1 59 








20 






25 






30 






160 


Leu 


Leu 


Asn 


Tyr 


Lys 


Gin 


Glu Gly Asp 


Lys 


Cys 


Val Glu Asn Pro 


Asn 




161 






3 5 








40 






45 






16 2 


Pro 


Thr 


Cys 


Asn 


Glu 


Asn 


Asn Gly Gly 


Cys 


Asp 


Ala Asp Ala Lys 


Cys 




163 




50 










55 






60 






1 6 4 


Thr 


Glu 


Glu 


Asp 


Ser 


Gly 


Ser Asn Gly 


Lys 


Lys 


lie Thr Cys Glu 


Cys 




16 5 


65 










70 






75 




80 




166 


Thr 


Lys 


Pro 


ASp 


Ser 


Tyr 


Pro Leu Phe 


ASp 


Gly 


He Phe Cys Ser 


Ser 




16 7 










85 






90 




95 






1 6 8 


Ser 


Asn 


Phe 


Leu 


Gly 


i.le 


Ser Phe Leu 


Leu 


He 


Leu Met Leu He 


Leu 




169 








100 






10 5 






110 






.170 


Tyr 


Ser 


Phe 


He 


















.1.71 






115 




















175 


<210> SEQ TC 


1 NO: 


6 
















176 


<211> LENGTH 


: : 342 
















A 77 


<212> TYPE: 


DMA 


















.178 


<213> ORGANISM: 


Plasmodium falciparum 










180 


<4 0 0> SEQUENCE: 


6 
















181 


aacatttcac aacaccaatg cgtaaaaaaa caatgtccag 


aaaattctgg atgtttcaga 


60 


182 


catttagatg aaagagaaga atgtaaatgt ttattaaatt 


acaaacaaga aggtgataaa 


120 


1H3. 


tgtgttgaaa atccaaatcc tacttgtaac gaaaataatg 


gtggatgtga tgeagatgee 


.180 


184 


aaatgtaccg aagaagat.Lc aggtagcaac ggaaagaaaa 


tcacatgtga atgtactaaa 


240 


185 


cctgattctt atccactttt cgatggtatt ttctgcagtt 


cctctaactt cttaggaata 


300 


186 


tcattcttat taatactcat gttaatatta tacagtttca 


tt 




342 


.189 


<210> SEQ ID 


' NO: 


7 . 
















190 


<2i:i 


> LENGTH 


: 3 87 
















191 


<212> TYPE: 


DNA 
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192 <213> ORGANISM: Plasmodium falciparum 

194 <22 0> FEATURE: 

195 <2 21> NAME/KEY: CDS 

196 <222> LOCATION : (1)..(387) 

193 <4 00> SEQUENCE: 7 



199 


a Lg 


aag 


9 eg 


eta 


c tc 


ttt 


tty 


ttc 


tct 


ttc 


att 


ttt 


ttc 


gtt 


acc 


aaa 


4 8 


200 


Met 


Lys 


• Ala 


Leu 


Leu 


Phe 


Leu 


Phe 


Ser 


Phe 


He 


Phe 


Phe 


Val 


Thr 


Lys 




201 


:i 








5 










10 










15 






203 


tgt 


caa 


tgt 


gaa 


aca 


gaa 


agt 


tat 


aag 


cay 


ctt 


yta 


gec 


aac 


gtg 


yac 


96 


204 


cys 


Gin 


Cys 


Glu 


Thr 


Glu 


Ser 


Tyr 


Lys 


Gin 


Leu 


Val 


Ala 


Asn 


Val 


Asp 




205 








20 










25 










30 








207 


gaa 


ttc 


aac 


ate 


teg 


cay 


cac 


caa 


tgc 


gt:g 


aaa 


aaa 


caa 


tgt 


ccc 


f jag 


144 


208 


Glu 


Phe 


Asn 


lie 


Ser 


Gin 


H is 


Gin 


Cys 


val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 




209 






35 










4 0 










45 










211 


aac 


tct 


yqc 


tgt 


ttc 


aga 


cac 


tty 


gac 


9^g 


aga 


gag 


gag 


tgt 


aaa 


tgt 


192 


212 


Asn 


Ser 


Gly 


Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 




213 




50 










55 










60 












215 


ctg 


ctg 


aac 


tac 


aaa 


cag 


gag 


ggc 


yac 


aag 


tgc 


gtg 


gag 


aac 


ccc 


aac 


240 


216 


Leu 


Leu 


Asn 


Tyr 


Lys 


Gin 


G lu 


Gly 


Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 




217 


65 










70 










75 










80 




219 


ccg 


acc 


tgt 


aac 


gag 


aac 


aac 


ygg 


ggc 


tgt 


yac 


gca 


gac 


gec 


aaa 


tgc 


288 


2 20 


Pro 


Thr 


Cys 


Asn 


Glu 


Asn 


Asn 


Gly 


Gly 


Cys 


Asp 


Ala 


Asp 


Ala 


Lys 


Cys 




221 










85 










90 










95 






22 3 


acc 


gay 


gag 


gac 


teg 


ggc 


age 


aac 


ggg 


aag 


aaa 


ate 


acy 


tgt 


gag 


tgt 


336 


224 


Thr 


Glu 


Glu 


Asp 


Ser 


Gly 


Ser 


Asn 


Gly 


Lys 


Lys 


lie 


Thr 


Cys 


Glu 


Cys 




22 5 








100 










10 5 










110 








227 


acc 


aaa 


ccc 


gac 


teg 


tac 


ccg 


ctg 


ttc 


gac 


ggc 


ate 


ttc 


tgc 


age 


taa 


384 


228 


Thr 


Lys 


Pro 


Asp 


Ser 


Tyr 


Pro 


Leu 


Phe 


Asp 


Gly 


lie 


Phe 


Cys 


Ser 







229 115 120 .125 

231 taa 387 
235 <210> SEQ TD NO: 8 

23 6 <211> LENGTH: 127 
237 <212> TYPE: PRT 

2 38 <213> ORGANISM : Plasmodium falciparum 

240 <400> SEQUENCE: 8 

241 Met Lys Ala Leu Leu Phe Leu Phe Ser Phe lie Phe Phe Val Thr Lys 

24 2 1 5 10 15 

2 43 Cys Gin Cys Glu Thr Glu Ser Tyr Lys Gin Leu Val Ala Asn Val Asp 

244 20 25 30 

24 5 Glu Phe Asn lie Ser Gin His Gin Cys Val Lys Lys Gin Cys Pro Glu 

246 35 40 45 

24 7 Asn Ser Gly Cys Phe Arg His Leu Asp Glu Arg Glu Glu Cys Lys Cys 

248 50 55 60 

249 Leu Leu Asn Tyr Lys Gin Glu Gly Asp Lys Cys Val Glu Asn Pro Asn 

250 65 70 75 80 

251 Pro Thr Cys Asn Glu Asn Asn Gly Gly Cys Asp Ala Asp Ala Lys Cys 

252 8 5 90 • ' 9 5 

253 Thr Glu Glu Asp Ser Gly Ser Asn Gly Lys Lys He Thr Cys Glu Cys 

254 100 105 ' 1:10 
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255 Thr Lys Pro Asp Ser Tyr Pro Leu Phe Asp Cly lie Phe Cys Ser 

256 115 120 12 5 

260 <210> SEC I'D NO: 9 

261 <211> LENGTH : 330 

262 <212> TYPE : DNA 

263 <213> ORGANISM: Plasmodium falciparum 

265 <220> FEATURE: 

266 <221> NAME/KEY: CDS 

267 <222> LOCATION: (1) 

269 <4 00> SEQUENCE : 9 

270 gaa aca gaa agt tat 

271 Glu Thr: Glu Ser Tyr 

272 1 5 

274 ate teg cag cac caa 

275 He Ser Gin His Gin 

276 20 

278 tgt ttc aga cac ttg 

279 Cys Phe Arg His Leu 

280 35 
282 tac aaa cag gag ggc 
28 3 Tyr Lys Gin Glu Gly 
284 50 

28G aac gag aac aac ggc 
2 87 Asn Glu Asn Asa Gly 
288 65 

290 gac teg ggc age aac 
2 91 Asp Ser Gly Ser Asn 
2 92 8 5 

294 gac teg tac ccg ctg 

295 Asp Ser Tyr Pro Leu 
W--> 296 100 

299 <210> SEQ ID NO: 10 

300 <211> LENGTH: 108 

301 <212> TYPE: PRT 

302 <213> ORGANISM: Plasmodium falciparum 



304 


<4 00> SEQUENCE: 


.10 
























305 


Glu 


Thr 


Glu 


Ser 


Tyr 


Lys 


Gin 


Leu 


Val 


Ala 


Asn 


Val 


Asp 


Glu 


Phe 


Asn 


306 


1 








5 










10 










15 




307 


He 


Ser 


Gin 


H is 


Gin 


Cys 


Va 1 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 


Asn 


Ser 


Gly 


308 








20 










25 










30 






309 


Cys 


Phe 


Arg 


His 


Leu 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 


Leu 


Leu 


Asn 


310 






35 










40 










4 5 








311 


Tyr 


Lys 


Gin 


Glu 


Gly 


Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 


Pro 


Thr 


Cys 


312 




50 










55 










60 










313 


Asn 


Glu 


Asn 


Asn 


Gly 


Gly 


Cys 


Asp 


Ala 


Asp 


Ala 


Lys 


Cys 


Thr 


Glu 


Glu 


314 


65 










70 










75 










80 


3.15 


Asp 


Ser 


Gly 


Ser 


Asn 


Gly 


Lys 


Lys 


He 


Thr 


Cys 


Glu 


Cys 


Thr 


Lys 


Pro 


316 










85 










90 










95 




3.17 


ASp 


Ser 


Tyr 


Pro 


Leu 


Phe 


Asp 


Gly 


lie 


Phe 


Cys 


Ser 











• - (330) 

aag cag ctt 
Lys Gin Leu 

tgc gtg aaa 
Cys Val Lys 

gac gag aga 
Asp Glu Arg 
40 

gac aag tgc 
Asp Lys Cys 
5 5 

gge tgt gac 
Gly Cys Asp 
70 

ggc aag aaa 
Gly Lys Lys 

ttc gac gge 
Plie Asp Gly 



gta gec aac 
Val Ala Asn 
10 

aaa caa tgt 
Lys Gin Cys 
25 

gag gag tgt 
Glu Glu Cys 

gtg gag aac 
Val Glu Asn 

gca gac gec 
Ala Asp Ala 
75 

ate acg tgt 
He Thr Cys 

■ 90 
ate ttc tgc 
He Phe Cys 
105 



gtg gac gaa. 
Val Asp Gl u 

ccc gag aac 
Pro Glu Asn 
30 

aaa tgt ctg 
Lys Cys Leu 
45 

cec aac ccg 
Pro Asn Pro 
60 

aaa tgc ace 
Lys Cys Thr 

gag tgt acc 
Glu Cys Thr 

age taa taa 
Ser 

110 



ttc aac 48. 
Phe Asn 
15 

tct ggc 96 
Ser Gly 

ctg aac 14 4 
Leu Asn 

acc tgt 192 
Thr Cys 

gag gay 24 0 
Glu Glu 
80 

aaa ccc 288 
Lys Pro 
95 

330 



file://C:\CRF3\Outhold\Vsiil 2503 1 B.htm 



1/8/01 



Page 6 of 7 



VERIFICATION SUMMARY DATE: 01/08/2001 

PATENT APPLICATION: US/09/125, 031B TIME: 15:01:48 

input Set : A:\6 60139.app 

Output Set: N:\CRF3\01082001\I125031B.raw 

L:72 M:258 W: Mandatory Feature missing, <220> FEATURE: 

L : 15 3 M : 2 5 8 W : Ma n d a tory Pea t u re m Is sing, <220> F E ATU RE : 

L:29G M:336 W: Invalid Amino Acid Number in Coding Region, SEQ ID: 9 
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